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ABSTRACT 

The present study validates a new procedure that combines continuous measures of 
proximity (Ubisense) and vocalization (LENA) into measures of peer social interaction. 
The data were collected from 4 boys and 5 girls (ages 2-3 at the outset) on 8 separate 
days (3-4 hours per day) over the course of an academic year. Teacher reports of 
friendship were positively correlated with continuous measures of dyadic social 
interaction (i.e, the amount of time two children spent in proximity to one another, 
talking). Self-reports of reciprocated friendship were marginally correlated with 
continuous measures of dyadic social interaction, but only in the spring semester 
(when children were older and their reports of friendship more reliable). At the 
individual level, peer nominations of likeability, and teacher ratings of sociability and 
withdrawal were correlated with continuous measures of social interaction (i.e, the 
amount of time a child spent in proximity to other children, talking). 
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Some of the first empirical studies of child development involved collecting 
in vivo data to measure the affiliations of preschool children (Goodenough & 
Anderson, 1931). Little has changed in the intervening years. Expert observa- 
tions remain the gold standard, underpinning the validity of child and 
teacher reports. It is easy to understand why scholars value expert observa- 
tions of affiliation: They are both accurate and descriptive. New technologies 
are available that build on these advantages, providing continuous, simulta- 
neous data on all children in a classroom. The goal of the present study is to 
validate a new procedure for assessing peer affiliation, one that integrates 
data obtained from continuous movement tracking (Ubisense: Killijian, 
Pasqua, Roy, Trédan, & Zanon, 2016) and continuous vocalization recordings 
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(LENA: Sangwan, Hansen, Irvin, Crutchfield, & Greenwood, 2015). We illustrate 
with data from an inclusive oral language preschool classroom for children 
with and without hearing loss. 

Friendships matter, even to very young children. Preschool children with 
friends are more socially competent and accepted than children without friends 
(Vaughn, Colvin, Azria, Caya, & Krzysik, 2001). Expert observations indicate that 
young children spend more time with those they reciprocally nominate as 
friends than with nonfriends, and more time with nonfriends than with class- 
mates they do not like (Hartup, Laursen, Stewart, & Eastenson, 1988; Masters & 
Furman, 1981). Similar findings emerge from teacher reports of friendship 
(McCandless & Marshall, 1957), which correlate with peer reports (Shin, Kim, 
Goetz, & Vaughn, 2014). Teacher and child reports also agree on the extent to 
which children are well liked by peers; expert observations corroborate these 
reports and suggest that well-liked children may enjoy status because they 
verbalize more during play than other children (Dunnington, 1957a, 1957b). 

The primary drawback of expert observations is their impracticality: They 
are cumbersome, expensive, and time-consuming; typically, only one child 
can be observed at a time, for a small window of time. Child reports have 
their own challenges: Participants, especially young ones, are often unco- 
operative or distracted, and the data collection process is intrusive. Scholars 
are hesitant to deploy friendship nominations with children younger than 3 
because they tend to be unreliable (Hartup, Glazer, & Charlesworth, 1967). 
Teacher reports are not typically used as a primary source of affiliation data 
because they can be confounded with perceptions of child characteristics 
and they may not be responsive to subtle shifts in patterns of affiliation. 
Still, given the challenge and expense of collecting expert observational 
data and the lack of available alternatives, most scholars are willing to 
tolerate the error inherent in child and teacher reports. 

We focus on two technological advances that provide an alternative to 
expert observations. Ubisense, a commercially available radio frequency 
tracking system, can be used gauge affiliation through measures of proxi- 
mity and mutual orientation (Messinger et al., 2019). LENA, a commercially 
available system of measuring child speech production, can be used to 
gauge social interaction through measures of expressive language (Perry 
et al., 2018). Using a different sample of children, we recently combined 
these two tools, demonstrating how they can be used to create a social 
contact score for each child in a preschool classroom (Perry et al., 2020). 
Here, we describe how these new forms of data converge with traditional 
child and teacher reports of peer affiliation. 
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Method 
Participants 


The participants included 9 children (4 boys, 5 girls) between the ages of 2 
and 3 at the outset (Mage = 30.33 months, SD = 2.45). Participants were 
drawn from an English-dominant oral language inclusion classroom, 
located at a university-affiliated school in Miami, USA. Participants included 
7 children who use hearing aids or cochlear implants and 2 typically hearing 
children. The participants included 2 African Americans, 3 Anglo Americans, 
3 Hispanic Americans, and 1 child with a mixed ethnic background. 


Procedure 


All children in the class participated. Written consent was obtained from 
parents and from teachers (n = 2 at the outset, then n = 3 from the end of 
the fall semester onward). Peer nomination data were collected by 
a trained research assistant in a quiet classroom setting. Teachers com- 
pleted questionnaires after class. Child and teacher reports of friendship 
were collected the same week as the continuous data, four times in the fall 
of 2017 (October-December) and three times in the spring of 2018 
(January-March). Peer and teacher reports of social engagement were 
collected at the beginning and end of each semester. Continuous measures 
of social interaction were collected one day a week approximately every 
other week in the fall semester and approximately every third week in the 
spring semester. Each observation was 3-4 hours in length. Data were 
collected indoors, in a 7.56m x 4.78m classroom. Stability coefficients for 
each variable are given in terms of coefficient r, accompanied by confi- 
dence intervals and p-values (both one-tailed to reflect directional tests). 


Measures 


Continuous measures of social interaction (Ubisense and LENA) 

Two objective measures of affiliation were integrated. Continuous mea- 
surements of child location were collected using the Ubisense 
Dimension4 system. Audio was collected using LENA Digital Language 
Processors. Each participant wore a vest (see Appendix) with two tags 
housed in the left and right rear pockets to provide information about 
orientation. LENA audio recorders were secured in the front pocket. The 
Ubisense system tracked each child’s location to an accuracy of 15 cm 
using ultra-wide radio frequency identification. Audio files were 
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analysed using LENA Pro V3.4.0 pattern recognition software. We define 
social interaction as a vocalization made when a child was within 1.5 
metres of another child, where both children were oriented (+45° 
degrees) towards one another. We created two measures of social 
interaction; z standardized to account for differences in the amount of 
time spent in class. The dyadic continuous measure of social interaction 
represents the number of vocalizations made when a child was in the 
presence of another child, divided by the amount of time the two 
children were in the classroom together (see Perry et al., 2020). 
A dyadic score was calculated for each pair of children in the classroom 
(N = 36), by averaging the social interaction scores of one child when in 
the presence of the other child (e.g., the percentage of time child 
A vocalized in the presence of child B plus the percentage of time 
child B vocalized in the presence of child A, divided by 2). Scores were 
z-standardized within waves, then averaged across waves. Dyadic scores 
were stable from the beginning to the end of each semester (fall r = .71 
[95% Cl: .50, .84], p < .001; spring r = .55 [95% Cl: .27, .74], p < .001), 
between assessment periods within semesters (fall range: r = .76 to .91 
[M = .85]; spring range: r = .10 to .65 [M = .43]), and between the mean 
of the fall and the mean of the spring semesters (r = .82 [95% Cl: .65, .91], 
p < .001). The individual continuous measure of social interaction repre- 
sents the number of vocalizations made by a child in the presence of 
any other child in the classroom, divided by the amount of time the 
child was in class. An individual score was calculated for each child in 
the class (N = 9). Scores were z-standardized within waves, then aver- 
aged across waves. Individual scores were stable from the beginning to 
the end of the fall semester (r = .91 [95% Cl: .55, .98], p = .001), between 
assessment periods within semesters (fall range: r = .89 to .95 [M = .92]; 
spring range: r = .12 to .77 [M = .45]), and between the mean of the fall 
semester and the mean of the spring semester (r = .89 [95% Cl: .55, .98], 
p = .002). Stability coefficients from the beginning to the end of the 
spring semester were substantial, but only marginally significant (r = .58 
[95% Cl: —.28, .86], p = .10). 


Friendship: Teacher reports 

Teachers identified all friendship dyads on a grid that listed each child in 
the class across the top and the side: ‘Friends are children who like one 
another and who play together.’ Teacher reports of friendship were stable 
from the beginning to the end of each semester (fall r = .32 [95% Cl: —.01, 
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59], p = .03; spring r= .37 [95% Cl: .05, .62], p = .01) between assessment 
periods within semesters (range r= .32 to .45 [M = .39]), and across the fall 
and spring semesters (r = .65 [95% Cl: .31, .82], p = .001). In the fall 
semester, teachers identified an average of 5.0 friend dyads per wave of 
data collection (SD = 1.0; min = 4, max = 6). In the spring semester, 
teachers identified an average of 4.7 friend dyads per wave of data 
collection (SD = 2.5; min = 2, max = 8). Each dyad received a teacher- 
report friendship score, representing the number of times during 
a semester that two children were identified as friends. 


Friendship: Child reports 

Children identified friends from an array of head and shoulder photos that 
included all 9 children in the class: ‘Who is your friend?’ Unlimited same- 
and other-sex nominations were permitted. Reciprocated friendships 
were defined as dyads in which both partners nominated one another 
as friends. Reciprocated friendships were unstable from the beginning to 
the end of the fall semester (r = —.19 [95% Cl: —.49, .15], p = .14), and 
between assessment intervals within the fall semester (range r = —.12 to 
57 [M = -.14]), but were stable from the beginning to the end of the 
spring semester (r = .80 [95% Cl: .64, .89], p < .001) between assessment 
intervals within the spring semester (range r = .17 to .25 [M = .21]), and 
across the fall and spring semesters (r = .48 [95% Cl: —.02, .74], p = .03). In 
the fall semester, children reported an average of 7.5 reciprocated friend- 
ships per wave of data collection (SD = 3.6; min = 3, max = 12). In the 
spring semester, children reported an average of 3.3 reciprocated friend- 
ships per wave of data collection (SD = 1.3; min = 2, max = 5). Each dyad 
received a child-report friendship score, representing the number of times 
during a semester that the two children were reciprocated friends. 


Social engagement: Teacher reports 

Teachers completed two 4-item measures from the Child Behaviour Scale 
(Ladd & Profilet, 1996), rating each child’s sociability (e.g., ‘Does this child 
play with other classmates enthusiastically?) and withdrawal (e.g., ‘Does 
this child prefer to play alone?’). Items were rated ona scale ranging from 
1 (Never) to 3 (Always). Internal reliability was good (alpha = .87 to .88). 
Teacher ratings were stable from the beginning to the end of the fall 
semester (Sociability r= .91 [95% Cl: .62, .98], p = .001; Withdrawal r = .92 
[95% Cl: .66, .98], p < .001), from the beginning to the end of the spring 
semester (Sociability r= .73 [95% Cl: .13, .94], p = .01; Withdrawal r = .72 
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[95% Cl: .11, .94], p = .01), and across the fall and spring semesters 
(Sociability r = .80 [95% Cl: .29, .96], p = .01; Withdrawal r = .85 [95% Cl: 
.43, .97], p = .001). 


Social engagement: Peer report 

At the beginning and end of each semester, peer nominations of 
likeability (Who do you like?’) were obtained using a standard socio- 
metric procedure. Children identified classmates from a board contain- 
ing separate photos of each child in the class. Unlimited same- and 
other-sex nominations were permitted. In the fall semester, children 
nominated an average of 1.8 peers per wave of data collection 
(SD = 0.2; min = 1.7, max = 2). In the spring semester, children 
nominated an average of 1.6 peers per wave of data collection 
(SD = 0.6; min = 1, max = 2.2). Each participant received a score 
representing the number of nominations he or she received from 
classmates, divided by the total number of nominators. Peer reports 
of likeability were not stable from the beginning to the end of the fall 
semester (r = .07 [95% Cl: —.62, .70], p = .42), nor across the fall and 
spring semesters (r = .30 [95% Cl: —.45, .80], p = .21). Stability coeffi- 
cients from the beginning to the end of the spring semester were 
substantial, but failed to reach conventional levels of statistical signifi- 
cance (r = .46 [95% Cl: —.29, .86], p = .11). 


Plan of analysis 


Missing data accounted for an average of 7.3% (SD = 0.07; min = 0%; 
max = 17.1%) of reports for all variables included in the study. There was 
no attrition, but attendance varied. In the fall semester, children were 
present for either 3 or 4 of the observation sessions (M = 3.8, SD = 0.4). In 
the spring semester, children were present for either 2 or 3 of the 
observation sessions (M = 2.9, SD = 0.3). There were no missing teacher 
reports in either semester. Children participated in 3 or 4 (M = 3.2, 
SD = 1.2) peer nomination sessions in the fall semester and 2 or 3 (M 
= 2.6, SD = 1.0) peer nomination sessions in the spring semester. Missing 
data were imputed using the joint model imputation approach (McNeish, 
2017). 

Two sets of analyses were conducted to validate the continuous mea- 
sures of social interaction. The first analyses (N = 36 dyads) examined 
convergence between the dyadic continuous measure of social interaction 
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and (a) teacher reports of friendship and (b) child reports of reciprocated 
friendship. We conducted two-way mixed, average measures, consistency 
intraclass correlations because the variables were designed to measure 
similar constructs and because the children participated in multiple, non- 
independent dyads (McGraw & Wong, 1996). The second analyses (N = 9 
children) examined convergence between the individual continuous mea- 
sure of social interaction and (a) average teacher ratings of social engage- 
ment and (b) average peer ratings of social engagement. We selected 
Pearson’s r interclass correlations because the variables did not represent 
identical constructs and because each child was included only once in the 
data. Convergence coefficients are accompanied by confidence intervals 
and p-values (both one-tailed to reflect directional tests). Analyses were 
conducted with SPSS (Version 25.0, IBM Corp, 2017). 


Results 


Table 1 presents results for dyadic measures of affiliation (N = 36 dyads). 
The dyadic continuous measure of social interaction was positively 
correlated with teacher reports of friendship in the fall and (at borderline 
levels of statistical significance) in the spring. The dyadic continuous 
measure of social interaction was correlated (at borderline levels of 
statistical significance) with child reports of friendship in the spring 
but not in the fall. Thus, dyads with higher continuous social interaction 


Table 1. Convergence between dyadic measures of peer affiliation: within-Dyad intra- 
class correlations. 


1 2 3 M 
Variable ([95% Cl] p) ([95% Cl] p) ([95% Cl] p) (SD) 
1. Dyadic continuous 34 24 0.00 
measure of social ([-.16, c°] .06) ([-.33, ce] .10) (0.61) 
interaction 
2. Child-reported —.24 84 0.28 
Friendship ([-1.0, c°] .37) ([.71, e] .001) (0.66) 
3. Teacher-reported 50 32 0.39 
Friendship ([.12, eo] .01) ([-.19, c0] .06) (0.73) 
M 0.00 0.83 0.28 
(SD) (0.93) (0.88) (0.57) 


Note. N = 36 dyads. Fall 2017 correlations below the diagonal and spring 2018 correlations above the 
diagonal. Dyadic continuous measure scores represent the number of vocalizations one child made in 
the presence of another child, divided by the amount of time the two children were in the classroom 
together; for each dyad, scores were averaged across partners, z-standardized within waves, and 
averaged across waves. Child-report scores represent the number of times during a semester that two 
children were reciprocated friends. Teacher-report scores represent the number of times during 
a semester that teachers identified two children as friends. One sided confidence intervals (in brackets) 
and p-values are presented. 
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Table 2. Convergence between individual measures of peer affiliation: within-individual 
interclass correlations. 


1 2 3 4 M 
Variable ([95% Cl] p) ([95% Cl] p) ([95% Cl] p) ([95% Cl] p) (SD) 
1. Continuous measure 54 719 -.63 0.00 
of social interaction ([-.07, ©] .07)  ([.38, ©] .01) = ([-x», —.07] .04) (0.50) 
2. Child-reported 21 56 -55 0.26 
likeability ([—.71, 0°] .30) ([—.04, c0] .06) — ([-ee, .05] .06) (0.09) 
3. Teacher-reported 70 —.15 —.73 2.79 
sociability ([.19, co] .02) ([-.68, 00] .35) ([-ee, —.25] .01) (0.23) 
4. Teacher-reported —.79 -10 —72 1.25 
withdrawal ([-e0, —.38] .01) [-c0, 65] .40) —([-e0, —.23] .01) (0.17) 
M 0.00 0.23 2.43 1.25 
(SD) (0.60) (0.10 (0.48) (0.35) 


Note. N = 9 children. Fall 2017 correlations below the diagonal and spring 2018 correlations above the 
diagonal. Continuous measure scores were z standardized within waves and averaged across waves. 
Likeability represents the number of nominations a child received from classmates, divided by the 
number of nominators. Sociability and withdrawal were rated on a scale ranging from 1 (never) to 3 
(always), averaged across reporters. One sided confidence intervals (in brackets) and p-values are 
presented. 


scores tended to be rated as friends by teachers and children, although 
some of these associations fell short of conventional statistical signifi- 
cance. Teacher and child reports of friendship were positively correlated 
in the fall and spring. 

Table 2 presents results for individual measures of affiliation (N = 9 
children). The individual continuous measure of social interaction was 
positively correlated with child reports of likeability (spring only) and with 
teacher reports of sociability (fall and spring); the individual continuous 
measure of social interaction was negatively correlated with teacher 
reports of withdrawal (fall and spring). Thus, individuals with higher 
continuous social interaction scores were more apt to score higher on 
sociability and likeability, and score lower on withdrawal. Teacher reports 
of withdrawal and sociability were negatively correlated (fall and spring); 
neither were correlated with child reports of likeability (with the excep- 
tion of a borderline statistically significant negative correlation with with- 
drawal in the spring). 


Discussion 


In this study, we paired Ubisense proximity measures with LENA vocaliza- 
tion recordings to create continuous measures of peer affiliation that 
assay dyadic friendship and individual sociability. The continuous mea- 
sures of social interaction largely aligned with traditional indicators of 
peer affiliation, accounting for 24-50% of the variance in teacher reports 
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of dyadic friendship and 49-62% of the variance in teacher reports of 
individual social engagement. Continuous measures of social interaction 
were not aligned with child reports of affiliation in the fall, but in the 
spring they accounted for 34% of the variance in child reports of dyadic 
friendship and 29% of the variance in child reports of individual likeability. 

Continuous measures of social interaction hold important advantages 
over traditional measures of friendship and sociability: they are unobtru- 
sive, they simultaneously assess all children in a classroom, and they have 
fewer sources of error. Child reports, in particular, were plagued by low 
stability, especially in the fall semester when some of the participants 
were only 2 years old, which is at the low end of successful administration 
of the picture sociometric task (Hymel, 1983). Some might object to the 
fact that continuous measures of social interaction operationalize friend- 
ship and social engagement in purely objective terms, and they would 
have a point; continuous measures fail to capture the affective dimension 
contained in questions about friendship. Of course, much depends on 
how friendship is construed. Young children place a premium on compa- 
nionship in friendships (Laursen & Hartup, 2002), suggesting that objec- 
tive measures conceptually align with self-reports during this age period. 

Scholars considering replacing expert observers with continuous mea- 
sures of social interaction may be concerned about the expense. LENA 
and Ubisense carry considerable set-up costs (several thousand USD at 
the time of publication) that have no parallel in expert observations. But 
the more data that one collects, the more the cost differential narrows, 
because expert observations are more labour-intensive than continuous 
measures of social interaction, during data collection and afterwards. 
Note also that the cost of the equipment should decrease (as is often 
the case with new technology), whereas the cost of the labour required 
for expert observations is apt to increase. Aside from costs, we see few 
other drawbacks to replacing expert observers and teacher reports with 
continuous measures of social interaction. Observer reports contain error, 
on the part of the observer and on the part of the coder. The technology 
described herein is not error-free; children are not always paying atten- 
tion when others in close proximity speak to them, but expert observers 
are not immune to these errors either and are inclined to make other 
errors. Expert observations are also limited in terms of the amount of data 
that can be collected. Teacher reports of companionship, although inex- 
pensive to obtain, carry few advantages over technological assessments. 
It is possible to deploy this technology in larger classrooms with more 
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students, where information demands are apt to overload teacher abil- 
ities to accurately report how much time each child spends in the com- 
pany of each other child, and confirmation bias (recalling answers from 
previous reporting periods) may give the illusion that teachers are reliable 
when in fact they may be missing subtle changes in social interaction 
patterns. Of course, if the researcher's goal is simply to measure the 
friendships of very young children, it may be difficult to justify the 
expense of continuous measurements. In these circumstances, our results 
should serve to remind scholars that child reports are less stable than 
teacher reports and that the two only weakly align; it may be prudent to 
collect information from both sources because they may capture some- 
what different constructs. 

Should continuous measures of child interaction replace child reports 
of friendship and likeability? We think not, because peer nominations 
reflect attitudes and affection, components of relationships and status 
that cannot be captured through the technologies described here. 

Our study is not without limitations. Although the teacher and child 
reports provide converging evidence of the validity of the continuous 
measures of interaction, our case would be stronger had we also included 
observational data. It is important to note that sensor data were only 
collected in the classroom; we would expect greater convergence with 
teacher and peer reports had we also collected playground data. It is 
essential to replicate the current results with larger samples, which will 
produce more stable parameter estimates and improve the overlap 
between objective measurement, teacher ratings, and child reports. Small 
samples limit the generalizability of the results, as does our use of an oral 
language inclusion classroom. It is worth noting, however, that these 
limitations should increase error variance, thus decreasing convergence 
across measures. More classrooms with more verbally proficient students 
could yield greater convergence. The majority of communication in the 
classroom was in English, but not all participants came from families who 
spoke English at home. This may have contributed to the low reliability of 
the sociometric task, particularly in the fall. Typically, intraclass correlations 
are not negative. The (nonsignificant) negative value we obtained with 
child-reports of friendship implies an excess of within-group error variabil- 
ity. Finally, it is possible that some speech made in the proximity of 
a classmate was not heard or attended to by that classmate or was not 
directed to that classmate. As a consequence, reciprocated interaction may 
have been over-estimated, especially among the highly verbal. By the same 
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token, verbosity might increase the chances that the socially central are 
erroneously identified as friends; it should not, however, adversely impact 
assessments of sociability or likeability. 

Empirical assessments of peer affiliation have changed little in the past 
century, despite their well-documented limitations. The opportunity for 
a technological upgrade is at hand. Both friendship and sociability can be 
measured objectively with new tools that continuously assess location 
and speech. As these tools become more affordable and widespread, 
scholars of peer relationships should not hesitate to adapt them to new 
purposes. 
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Appendix 


Left: Child wearing vest outfitted with LENA recorder (front pocket). Right: Backside of 
vest with pockets for Ubisense tags (left and right). 


